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Abstract 

Every chordal graph G can be represented as the intersection graph of a collection of subtrees of a host tree, a so-called 
tree model of G. The leafage £{G) of a connected chordal graph G is the minimum number of leaves of the host tree 
of a tree model of G. The vertex leafage vi{G) is the smallest number k such that there exists a tree model of G in 
which every subtree has at most k leaves. The leafage is a polynomially computable parameter by the result of [1 1]. 
In this contribution, we study the vertex leafage. 

We prove for every fixed k > 3 that deciding whether the vertex leafage of a given chordal graph is at most k is 
NP-complete by proving a stronger result, namely that the problem is NP-complete on split graphs with vertex leafage 
of at most k+ 1. On the other hand, for chordal graphs of leafage at most £, we show that the vertex leafage can be 
calculated in time rP^^\ Finally, we prove that there exists a tree model that realizes both the leafage and the vertex 
leafage of G. Notably, for every path graph G, there exists a path model with ^(G) leaves in the host tree and it can 
be computed in 0(n'^) time. 
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1. Introduction 

In the following text, a graph is always finite, simple, undirected, and loopless. We write that a graph G = {V 
has vertex set V(G) and edge set E(G). We write uv for the edge {u,v) G E(G). We write Nq{v) to denote the 
neighbourhood of v in G, and write Nq [v] = Nq {v)U{v}. The degree of in G is denoted by deg^ {v) = \ Nq (v) \ . 
Where appropriate, we drop the index G, and write N{v), N[v], and deg(j7), respectively. We write G[X] to denote 
the subgraph of G induced by X C V{G), and write G — X for the graph G[y(G) \ X]. We write G — I'forG — {i'}. 
We say that X is a clique of G if G[X] is a complete graph, and X is an independent set of G if G[X] has no edges. 

A tree model of a graph G = [V , E) is a pair ,"7 = (T, {Tii}uev) where T is a tree, called a host tree, each T,, is 
a subtree of T, and a pair uv is in E if and only if V[Tu) ^V{T-i,) 7^ 0. In other words, 5^ consists of a host tree and 
a collection of its subtrees whose vertex intersection graph is G. 

A graph is chordal if it does not contain an induced cycle of length four or more. It is well-known [1,6, 20] that a 
graph is chordal if and only if it has a tree model. 

For a tree T, let ^(T) denote the set of its leaves, i.e., vertices of degree one. If T consists of a single node, we 
define ^{T) = (Z). In other words, we consider such a tree to have no leaves. 

The leafage of a chordal graph G, denoted by £{G), is defined as the smallest integer £ such that there exists a 
tree model of G whose host tree has £ leaves (see [14]). It is easy to see that £{G) = if and only if G is a complete 
graph, and otherwise £{G) > 2. Moreover the case £{G) < 2 corresponds precisely to interval graphs (intersection 
graphs of intervals of the real line) [4]. In this sense, the leafage of a chordal graph G measures how close G is to 
being an interval graph. 

In this paper, we study a similar parameter. 



*The main work on this project was done during the second author's visit at the University of Toronto in 2009 and later during the first author's 
visit at the Caesarea Rothschild Institute of the University of Haifa in 201 1 . Both visits were supported by Prof. Derek Cornell of the University of 
Toronto via his NSERC grant. The second author also gratefully acknowledges support from EPSRC, award EP/I01795X/1. 

Email addresses: chaplick@cs . tor onto . edu (Steven Chaplick), j . st ache @ Warwick .ac.uk (Juraj Stacho) 
Preprint submitted to Elsevier January 19, 2013 



Definition 1. For a chordal graph G = {V, E), the vertex leafage of G, denoted by vi{G), is the smallest integer k 
such that there exists a tree model (T, {Tu}u€v) of G where |^(T„) | < /c for aU m G V. 

In other words, the vertex leafage of G seeks a tree model of G where each of the subtrees (corresponding to the 
vertices of G) has at most k leaves and the value of k is smallest possible. 

As in the case of leafage, the vertex leafage is a natural parameter related to some subclasses of chordal graphs 
previously studied in the literature. We note that v£{G) > 2 unless G is a complete graph (in which case vi{G) = 0), 
and the case v£{G) < 2 corresponds precisely to the so-called path graphs (intersection graphs of paths in trees) [7] 
(see also [2, 13, 15, 17]). Thus, the vertex leafage of a chordal graph G can be seen as a way to measure how close 
G is to being a path graph. In [10], it is further observed that in 0{kn) time one can find: an optimal colouring, 
a maximum independent set, a maximum chque, and an optimal chque cover of an n-vertex chordal graph G with 
vertex leafage fc if a representation of G (a tree model reaUzing vertex leafage) is given. 

In [7] it is shown that path graphs can be recognized in polynomial time. Currently, the best known recognition 
algorithms for path graphs run in 0{nm) time [2, 17], where n — \V{G) \ and m — \E{G)\. In other words, for a 
graph G, testing whether vi{G) < 2 can be performed in 0{nm) time. 

Some other restrictions/variations on the standard tree model have also been studied. One such family of these 
variations is captured by the [h, s, t] graphs (introduced in [12]) defined as follows: G — {V,E) is an [h, s, t] graph 
if there is a tree model {T,{Tu }uev) of G such that the maximum degree of T is at most h, the maximum degree of 
each of {Tu}ueV is s, and uv is an edge of G if and only if T„ and T^, have at least t vertices in common. For more 
information on these graphs see [3, 9J. 

We summarize the results of our paper in the following theorems. 

Theorem 2. For every k > 3, it is NP-complete to decide, for a split graph G whose vertex leafage is at most fc + 1, 
if the vertex leafage ofG is at most fc. 

Theorem 3. For every i > 2, there exists an rp^^'l time algorithm that, given an n-vertex chordal graph G with 
1{G) < I, computes the vertex leafage ofG and construct a tree model ofG that realizes the vertex leafage ofG. 

Theorem 4. There exists an O(n^) time algorithm that, given an n-vertex chordal graph G — (V, E) and a tree 
model (T, {T^j^gy) ofG, computes a tree model (T*, {T^}uev) "/G such that 

(i) |if(r*)| < \^{Tu)\forallue V, 

(ii) \^iT*)\=£{G). 

Corollary 5. For every chordal graph G = {V, E), there exists a tree model {T*, {T*}ugy) such that 

(i) |if(r*)| < v£{G)forallu e V. 

(ii) \^{T*)\=i{G), 

In other words, such a tree model is optimal with respect to the leafage and also the vertex leafage ofG. 

This paper is structured as follows. First, in §2, we discuss some technical details related to tree models. After 
that, in §3, we prove for every fixed fc > 3 that deciding whether the vertex leafage of a chordal graph is at most fc is 
NP-complete (i.e., we prove Theorem 2). In light of theorem 2, in §4, we discuss calculating vertex leafage subject to 
bounded leafage. More specifically, for bounded leafage £, we show how to compute the vertex leafage in time n*-^'-'^' 
(i.e., we prove Theorem 3). Finally, in §5, we show that the vertex leafage and leafage of any chordal graph G can be 
realized simultaneously in a tree model of G (i.e., we prove Theorem 4 and Corollary 5). We close the paper in §6 
with a sunmiary and a discussion of possible extensions of this work. 

2. Minimal Tree Models and Clique Trees 

We need to discuss a particular type of tree models of chordal graphs. Most of this section is rather technical and 
a reader experienced with tree models can easily skip this part. However, we include it for completeness as some of 
the subtle transformations involved may not be clear to every reader. 
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Let G — {V, E) be a chordal graph. We say that two tree models ^ — (T, {Tu}uev) ^tnd ^' — {T', {Tl}uev) 
of G are isomorphic, and write ^ ~ if there exists an isomorphism (p between T and T' that induces an 
isomorphism between r„ and for all M e V, namely (j?(y(ru)) = V{T^). 

A tree model =^ — (T, {Tu}uev) of G is minimal if |l'(T) | is smallest possible among all tree models of G. A 
clique tree of G is a tree T whose nodes are the maximal cliques of G such that for all C, C' G ^(7"), every C" on 
the path between C and C' in T satisfies C" ^ C n C'. Every clique tree T of G defines a tree model of G, where 

= (T, {r„}„gy) and T„ is defined as T [{C G V (T) | m G C}] for all m G V. 

Fact 6. Let ^ — {T,{Tu\uev)^^^ tee model of G. Then the following statements are equivalent. 

(i) ^ is a minimal tree model of G. 

(ii) ^ ^ for some clique tree T of G. 

(iii) For all XY G E(T), contracting XY in T and all subtrees containing it yields a tree model of G' G. 

(iv) The mapping ip defined for X G V{T) as ^p{X) = {1/ G V | X G V'(T„)} is a bijection between the vertices of 
T and the maximal cliques of G. 

Proof. (i)=^(iii) and (ii)<S4>(iv) are clear, while (iii)^(iv)^(i) follow from the Helly property of subtrees. □ 

Note that (iv) in the above claim states, in other words, that the set of all vertices of G whose subtrees contain X 
is a maximal clique of G. In particular, for any tree model, the set of such vertices is always a chque of G, but it is not 
always necessarily a maximal chque. This is only true for minimal tree models. 

It follows from Fact 6(i)<;=>(iii) that every tree model (T, {Tu}u€v) of G can be transformed (by contracting some 
edges of the host tree and the subtrees) into a minimal tree model (T', {T^}uev)- Notably, as this transformation 
involves only contracting edges, it follows that this does not increase the number of leaves both in the host tree and 
the subtrees, namely \^{T') \ < |=^(T)| and \^{Tu)\ < \.^{T'u)\ for all m G V. 

This observation allows us to focus exclusively on minimal tree models. Namely, it shows that if there exists a 
tree model with minimum number of leaves in the host tree (subtrees), then there also is a minimal tree model with 
minimum number of leaves in the host tree (subtrees). Consequently, in the remainder of the paper, all tree models 
are assumed to be minimal tree models unless otherwise specified. 

Furthermore, using Fact 6(i)<;4>(ii), we shall view minimal tree models of G as tree models defined by clique trees 
of G. We shall switch between the two viewpoints as needed. 

3. Vertex Leafage is NP-complete 

In this section, we prove Theorem 2 stating that calculating the vertex leafage of a chordal graph is NP-complete. 
We describe a polynomial time reduction from the problem NOT-ALL-EQUAL-fc-SAT which is well-known to be 
NP-complete [5]. 

Proof of Theorem 2. The problem is clearly in NP as one can easily compute in polynomial time the number of 
leaves in subtrees of a given tree model. To prove NP-hardness, we show a reduction from NOT-ALL-EQUAL- 
fc-SAT. By standard arguments [16J, we may assume, without loss of generality, that the instances to this problem 
contain no repeated literals and no negated variables. Thus we can phrase the problem as follows. 
NOT-ALL-EQUAL-fc-SAT 

Instance T: a collection Ci, C2, Cm of fc-element subsets of {vi, . . .,v„}; 

Solution to I (if exists): a set S C {vi, . . . ,v„} such that each / G {1 . . . m} satisfies Cj\S ^ <Z) and S \ C^ 7^ 0. 

In addition, we may assume the following property of any instance I. 
(★) There are no distinct indices /, such that Vj+ G Cy whenever Vj G Cy. 

Indeed, if there exist i 7^ with c,+ G Cy whenever c, G Cy, then we replace T by another instance T"*" constructed 
from I by removing Vj and all clauses Cy that contain Vj. If there is a solution to I, then clearly S \ {vj} is a solution 
to I"*". Conversely, if S is a solution to I"*", then either S is a solution to I if I7,+ G S, or S U {z;;} is a solution to I if 
otherwise. 

Now, for the reduction, we consider an instance I satisfying (★) and construct a graph, denoted by Gj, as follows: 
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(i) the vertex set of Gj consists of n + m + 2 vertices: V(Gi) — {vi, . . . ,Vn,y\, ■ ■ ■ , ym, Z\, Z2}, 

(ii) the vertices {1/1, . . . , i/m } form a cHque, 

(iii) the vertices {vi, . . .,v„, Zi, Z2} form an independent set, 

(iv) each vertex Vj is adjacent to all vertices yj such that Vi G Cj, 

(v) the vertices Zj, Z2 are adjacent to each vertex of the clique {yi, . . . ,ym}- 

We observe that Gj is a spht graph with partition into clique {yi, . . .,i/m} and independent set {vi, . .., i?„,Zi,Z2}. 
We prove that the vertex leafage of Gj is: 

(a) at most fc + 1, and 

(b) is at most k if and only if there is a solution to X. 

To do this we analyze the cliques of Gj. This is easy, since Gj is a split graph; all its maximal cliques are 
formed by taking a vertex of the independent set with its neighbourhood. In particular, the maximal cUques of Gj are 
A. = {zi,yi,...,ym}, B = {z2,i/i, . . . ,i/m}, and Q, = {vi}U{yj \ Vi G Cy}foreachi G {l...n}. 

We first prove (a). Recall that {A, B, Qi, . ■ . ,Qn} is the set of all maximal cliques of Gj, and hence, the vertex 
set of every clique tree of Gj. Each of the vertices Zi, Z2, and Vj, for / G {1 . . . n}, belongs to exactly one of these 
cliques, namely A, B, and Qj, respectively. Also, each yj, for j G {1 . . . m}, belongs to exactly k + 2 cliques, namely 
A, B, and {Qi^, ■ ■ ■ , Q/^.} where Cj = {vj^, . . . , z?/^}. So, as fc > 3, every tree spanning these cliques has at most 
A: + 1 leaves. We thus conclude that in every clique tree of Gx, each subtree corresponding to a vertex of Gj has at 
most fc + 1 leaves. In other words, any clique tree of Gj certifies that v£(Gx) <k + l which proves (a). 

We now prove (b). Let S be a solution to I. Construct a tree T with vertex set {A, B, Qi, . ■ .,Qn} and edge set 
{AB} U {AQi I Vi G S} U {BQi \ i ^ S}. Let us verify that T is a clique tree of Gj. Its vertex set is the set of all 
maximal cUques of Gj. For distinct /, /+ G {1 . . . n}, the path between Q, and Q;+ contains A or B or both, and no 
other vertex. Note that Q; H Q,+ C {yj, . . . ,y^} — ACi B. This verifies the path between Q, and Q,+ . Similarly, 
the path between Q, and A or B additionally contains only A or B and we have Q, H A = Qj O B which verifies this 
path. That exhausts all paths in T and thus confirms that T is indeed a clique tree of Gj. 

Let ,5j = (T, {Tv}„^v{Gx)) model corresponding to T. We analyze its subtrees. First, we consider 

the subtree T^. where / G {1 . . .n}. As in (a), we observe that the vertex Vi only belongs to one clique of Gj, 
namely Q;. Thus |V(Ti,;)| = 1 implying |jSf(Ti,;)| = by our convention. Similarly, the vertices Zi and Z2 
each belong to only one clique, A and B respectively, and we have |^(Tzj)| = \^{Tz2)\ = 0. It remains to 
consider Ty^. for / G {1 ... m}. The vertex yj belongs to the cliques A, B, and k distinct cliques Q/^, . . . , Qj^. where 
Cj — {vi^,.. - /Vij^}. The cliques Q/j, . . •, Q;^. are leaves of Ty. as they are leaves of T. However, neither A nor 
B is a leaf of Ty.. Indeed, since S is a solution to I, there are indices p,r G {1 . . .k} such that Vj^ G S and 
Vj^ ^ S. Hence, by construction, T contains edges AQj^ and BQi^. So, Ty. contains these edges as well as the 
edge AB. Thus both A and B have at least two neighbours in Ty. and are therefore not leaves of Ty.. Consequently, 
\^{Ty.) \ — I {Qfj, . . . , Qi^}\ — k which imphes vl{Gx) < as certified by the tree model ^j. 

Conversely, suppose that vl{Gx) < k. Then there exists a clique tree T of Gj such that the corresponding model 
J^j — (T, {Tv}veV(Gi)) satisfies | (Ti,)| < k for all v G V{Gx)- We analyze the structure of T. First, we observe 
that AB must be an edge of T. If otherwise, the path between A and B in T contains some chque Q;, / G {1 . . . n). 
As T is a clique tree, we conclude {yj ■ . .ym} ~ ■Ar\B C- Qi — {vj} U {yj \ Vi G Cj}. But then Vj belongs to each 
Cj, G {1 . . . m}, and since n>k>3, this contradicts (*). Similarly, we show that each Q,-, / G {1 . . . n} is a leaf 
of T. If otherwise, some Q, has at least two neighbours in T. These cannot be A, B as this would imply a triangle in 
T, since AB is an edge of T. Thus Q, is adjacent to Q,+ for some f+ G {1 . . . n}. As T is a tree, we have that either 
Q,+ hes on the path from A to Q,, or Q/ lies on the path from A to Q,+ . By symmetry, we may assume the former. 
Thus, since T is a chque tree, we conclude {yj \ Vj G Cj} — AOQi C Q;+ = {vi+} U {yy 1 G Cj}. So Vi+ G Cj 
whenever c, G Cj, contradicting (★). 

Now, we are ready to construct a set S C {vi, . . .,v„} as follows: for each i G {1 . . . n}, we put Vj in S if 
AQi is an edge of T. We show that S is a solution to I. If not, there exists G {1 . . . m} such that either S 3 Cj 
or S n Cj = 0. We look at the subtree Ty. corresponding to the vertex yj. Recall that yj belongs to cliques A, B, 
and k cUques Qj^, . . . , Q,j, where C^ = {vi^, . . ., Vj^^}. The cliques Q,j, . . . , Qj^ are leaves of Tyj because they are 
leaves of T (as proved above). If S C Cj, we have, by construction, that A is the unique neighbour of each of the 
chques Qjj, . . . , Q,j, in T. Consequently, none of the cliques Q,j, . . . , Q;^ is adjacent to B in T. This shows that B is 
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only adjacent to A in Ty., and hence, is a leaf. But then |^(Ty^.) | — | {Qjj, . . . , Q;^, B}\ — k + 1, contradicting our 
assumption about T. Similarly, if S n Cy = 0, the cliques Q/^, . . . , Q,^ are only adjacent to B and not to A, in which 
case, A is a leaf of Ty^ leading to the same contradiction. 

Therefore, S must indeed be a solution to I and that concludes the proof. □ 

4. Vertex Leafage Parameterized by Leafage 

In this section, we discuss calculating vertex leafage in chordal graphs of bounded leafage. Namely, we prove 
Theorem 3, that is, for a fixed i, we demonstrate how to calculate the vertex leafage of an n-vertex chordal graph G 
with i{G) < iin polynomial time, namely, in time n^^^\ We do this by enumerating chque trees of G with respect 
to high (> 3) degree nodes. The enumeration is based on the observation that the number of high-degree nodes in a 
tree is directly related to the number of leaves. This goes as follows. 

For a tree T, let J^{T) denote the set of nodes of T of degree > 3, and let <^(T) denote the set of edges of T 
incident to the nodes mJf{T). Further, let n,- denoted the number of nodes of degree i in T. Then 

W |J^(T)| = E"^ < \^(T)\ < X](i-2)n; = 2|E(T)| -2|y(T)| +ni = \^{T)\-2 

i>3 i>3 

In particular, if |.if(r)| is bounded, then so is \J^{T)\ and |(o'(r)|. This will become useful later. 

Recall that the vertex set of every clique tree of G is the set of all maximal cliques of G. Notably, all clique trees 
have the same vertex set. Let '^{G) denote the chque graph of G, i.e., the graph whose nodes are the maximal cliques 
of G and where two nodes are adjacent if and only if the corresponding maximal cliques intersect. It is well-known 
[8, 18] that every clique tree of G is a spanning tree of ^(G). 

Our algorithm is based on the following lemma. 

Lemma 7. There is an O(m^) time algorithm that, given an n-vertex chordal graph G and a set F C E(^(G)), 
decides if there exists a clique tree T of G with S'(T) — F and constructs such a tree if one exists. 

Proof. We describe an algorithm for the problem as follows. 
Algorithm 1: 
Input: A chordal graph G and a set F C E('^(G)). 

Output: A chque tree T of G with ^{T) —F,or report that no such tree exists. 

1 Construct a graph G' as follows: 

y(G') = V{G) U {i;^ I e e F} 

E(G') = E(G) U {uve I e = CC' , h G C U C'} U {veV,i | e n e' 7^ 0} 

2 if G' is chordal then 

3 Construct a clique tree T' of G' with minimum number of leaves. 

4 Construct a tree T from T' by renaming each node C' G V{T') to C' fl V{G) 

5 if T is a clique tree of G and S{T) — F then 

6 I return T 

7 return "no such tree exists" 

We now prove correctness of the above algorithm. For simplicity, we shall refer to any clique tree T with ^(T) — 
F as a "solution". First, observe that if the algorithm returns the tree T in hne 6, then this is indeed a solution. This 
proves that if there is no solution, the algorithm provides the correct answer in line 7. 

Thus, for the rest of the proof, we may assume that a solution exists. Namely we shall assume there is a clique 
tree T* of G satisfying S{T*) — F. For every maximal chque C of G, define (p{C) = C U {ve | C G e}. 

In the following claim, we discuss the properties of the graph G' constructed in line 1 . 

(7 ) G' is chordal, satisfies i{G') < |^(T* ) |, and (p is a bijection between the maximal cliques of G and G' . 

To prove the claim, we construct a minimal tree model of G' as follows. Let ^* = {F* ,{Fu}ugv(g)) the 
minimal tree model of G that is defined by the clique tree T*, namely T* = T[{C G V{F*) \ u G C}]. For each 
edgee = CC' G F, define T*^ = T*[{C,C'}]. Finally,let ^+ = {T*,m}ueV{G) U {TDeeF}). 
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It is easy to verify that S^^ is a tree model of G'. In particular, each subtree in the collection is a connected 
subgraph of T*. This follows from the fact that T* is a clique tree and that F = SiT*) C E(T*). Further, for each 
edge e = CC in F, we see that the subtree T*^ intersects only subtrees T* where C or C' is in 1/ (T* ), i.e., those where 
M e C U C. Moreover, T*^ only intersects subtrees T^, where C or C' is in V'(T^,), i.e., those where e n e' 7^ 0. 
This corresponds precisely to the definition of G'. 

Thus, we conclude that G' is indeed a chordal graph, and i{G') < | Jf(T*) | as ^+ is a particular tree model of 
G' and T* is its host tree. Morever, we see that S^^ is actually a minimal tree model of G'. Indeed, if there were 
a tree model of G' with less than |V(r*)| nodes in its host tree, then by removing subtrees corresponding to the 
vertices {v^ | e G F} we would obtain a tree model of G whose host tree has less than |y(T*) | nodes. But this would 
contradict the minimality of S^f* . 

This implies, by Fact 6(ii), that there exists a clique tree T+ of G' that defines i.e., — Namely, 
there is an isomorphism between r+ and the host tree T* of ^+ where each node C G V{T*) corresponds to the set 
of all vertices of G' whose subtrees contain C, i.e., the set {u G V(G) | C G V(T„)} U {ve \ C G V{Ty^)} which 
is exactly (p{C). In other words, y(T+) — {(p{C) \ C G y(T*)}, and consequently, cp constitutes an isomorphism 
between T* and T+. As one is a clique tree of G and the other a clique tree of G', we conclude that (j9 is a bijection 
between the maximal cliques of G and G' . This proves (1). 

This proves that the test in Line 2 succeds. Now, consider the trees T' and T constructed in line 3 and 4. Notably, 
T is a clique tree of G' with \^{T') \ = ^(G'). 

(2 j T ;s fl clique tree of G. 

Recall that T is obtained from T' by renaming each node C' of T' to C' fl ^(G). Moreover, by (1), the mapping <p 
is a bijection between the maximal cUques of G and G' . Namely, for each C' G V(T), the set C = (J5^^(C') is a 
maximal clique of G. Therefore, we can write 

C n v{G) = epic) n y(G) = (c u {u, | c g e}) n y(G) = c = <p~^(c'). 

This proves that the vertex set of T is precisely the set of maximal cliques of G, and cp is an isomorphism between 
T and T', by the construction of T. To see that T is indeed a cUque tree of G, it remains to prove the "connectivity 
condition" for T. Namely, consider nodes Ci, C2 G y(T) and a node C3 on the path in T between Ci and €2- Since 
<p is an isomorphism between T and T', we have <p(Cj) G V{T') for / = 1,2,3 and cpiC^) hes on the path in T' 
between f{Ci) and f{C2). Thus, we conclude cpiC^) 2 <p(Ci) fl <p(C2) because T' is a clique tree. So we write 

C3 = (^(Cs) n y(G) D (p(Ci) n f{C2) n y(G) = Ci n C2. tms proves (2). 

This proves that T is a clique tree of G. Notably, as T* is also a clique tree of G, we have that both T and T* have 
the same vertex set, i.e., y(r) = V{T*). We now look at the edges of T. 

(3) F C E(r) 

Consider an edge e — CC' G F, and recall the definition of (p and the claim (1). From this it follows that (p{C) and 
(p{C') are the only maximal cliques of G' that contain Ve- As cp(C) and (p{C') are also nodes of T' which is a clique 
tree of G', we conclude that every maximal clique on the path in T' between <p(C) and (p{C') also contains Ve. But, 
as Ve is in no other maximal clique of G', this is only possible if <p(C) and cp{C') are adjacent in T'. Consequently, C 
and C' are adjacent in T, namely e G E{T). This proves (3). 

(4) Jif{T*) C (T) and each C G Jf'(T*) iarii/?ei Nr*(C) C Nt(C). 

Consider C G Jif{T*), namely C is a node of T* with at least three neighbours in T*. Then, by the definition of 
(r*), aU edges incident to C in T* belong to ^(T*). As (?(r*) = F and F C E(T) by (3), the edges incident 
to C in r* are also edges of T. In other words, every neighbour of C in T* is a neighbour of C in T, namely 
Nt{C) 2 Ny* (C). Thus C has at least three neighbours in T implying C G (T). This proves (4). 

(5) Jf{T) = (r*) anJ ^{T) = <^'(r*). 

By (3), we conclude Jf{T) 2 J^(r*). For the converse, we calculate using (1) and (★) as follows. 

^(G0<mr)|=2+ ^ (deg^.(C)-2) <2+ £ ( deg^(C) - 2) = |jSf (T)| = ^(G') 

CeJf(T*) C6jr(T) 
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Note that the second inequahty follows from (4) and the fact that degj-{C) > 3 for all C G (T), while the last 

equality is by i{G') = \^{T') \ and the fact that T and T' is isomorphic. 

It follows that the inequalities in the above formula are, in fact, equalities. Therefore, using (4), we conclude that 

Jf{T) = (T*) and every C G Jf{T*) satisfies Nt(C) = Nt*{C). To see this, recall that each C G Jf{T*) 

contributes to the sum on the right at least as much as to the sum on the left, since Nj{C) ^ Nj* (C) by (4). Further, 

every C E Jf{T) has a positive contribution to the sum on the right as degj(C) > 3 by the definition of Jf{T). 

Thus, since the two sums are equal, the only possibiUty is that Jf{T) — Jif{T*) and that each C G Jif{T*) satisfies 

Nt{C) = Nt*{C) as claimed. 

To conclude the proof, recall that S'{T), resp. ^(r*), is the set of edges of T, resp. T*, incident to the nodes in 
(r),resp. Jf{T*). AsJ^{T) = (T*) and each C G J^{T) = J^{T*) is incident to the same set of edges in 

T and T* for it satisfies Nt(C) = Nt* (C), we conclude that /{T) = f?(T*). This proves (5). 

This and (2) prove that T is indeed a solution, namely T is a clique tree of G with S'{T) — #(T*) — F. Hence, 
the test in Line 5 succeds and the algorithm correctly retum a solution in Line 6. 

That concludes the proof of correctness of the algorithm. To address the complexity, let n = \V{G) \ as usual. 
First, we note that we may assume that F contains at most n — 1 edges as no chque tree of G has more than n edges. 
If this is not so, we can safely report that no solution exists. Thus, as G' has |V(G)| + \F\ = 0{n) vertices, we 
conclude that step 3 takes O(n^) time using the algorithm of [14]. All other steps clearly take at most O(n^) time. 
Notably, in step 2 we use a linear time algorithm from [19]. 

Thus the total complexity is O(n^) as promised. That concludes the proof. □ 

Finally, we are ready to prove Theorem 3. 

Proof of Theorem 3. Let G be a chordal graph with £{G) < £. By Corollary 5 (proven in §5), there exists a tree 
model of G that simultaneously minimizes both the leafage and the vertex leafage. By the remarks in §2, there is also a 
clique tree of G with this property; let T* denote this clique tree. Note that \^{T*) \ = i{G) and |^(T*) | < vl{G) 
for all u G V{G) where T* = T* [{C G V{T*) 1 m G C}] . 

We show that it suffices to know the set (f ( T* ) to find a tree model that minimizes the vertex leafage. 

{6} If T is a clique tree ofG with S'{T) — S'{T*), then T minimizes the vertex leafage. 

Consider M G V{G). We need to show that |if(r„)| < vi{G) where T„ = r[{C G y(T) | m G C}]. 

First, we observe that ^{T) ^ ^{T*) implies ^(T) = ^(T*) and each C G ^(T) = ^(T*) has the same 
neighbourhood in both T and T*, i.e., Nt{C) = Nt*{C). Next, we remark that if a node has degree > 3 in Tu, 
then it also has degree > 3 in T, since T„ is an subgraph of T. In other words, we have J(f{Tu) C J^f(T). Further, 
we observe that each C G V{Tu) satisfies Nj^{C) — Nj{C) D V{Ti,), since Tu is an induced subgraph of T. By 
the same token, Nr*(C) = Nt*{C) n V{T*) for each C G V{T*). Finally, we note that V{Tu) = V{T*), since 
V{T) = V{T*). Thus, for each C G (T„), we can write 

Nt^c) = Nt{C) n v{Tu) = Nt*{C) n v{t*) = Nt*{c). 

This impUes C G Jf{T*) and degj^ (C) = degj, (C) for aU C G =^(T„). Thus, we calculate by (*). 

|^(T„)|=2+ ^ (deg^„(C)-2)<2+ ^ (deg^.(C) -2) = |^(T*)| < z;^(G) 

ceJf(Tu) ceje{Ti) 

For the inequality to hold, also note that degj., (C) > 3 for each C G M'{T*), by definition. This proves (6). 

This claim allows us to finally formulate our algorithm. We need to introduce additional of notation. Let F C 

E('^(G)). If there exists a clique tree T with (5?(T) = F, then define vif — max^gy^Qj \^{Tu)\ where T„ — 
T[{C G V{T) I u G C}] . If such a tree does not exist, define vip = +oo. Observe that vig-^jt-j < vl{G). 

Our algorithm tries all possible sets F C "^(G) of size at most ^ — 2 as candidates for # (T*) and chooses one 
that that minimizes vif. If F^pf is this set, the algorithm outputs a chque tree Topt of G with <^{Topt) — Fopt- 

We claim that this algorithm correctly finds a clique tree of G that minimizes the vertex leafage. By (★), we 
observe that <^{T*) < \^{T*) \ - 2 < £ - 2. Thus, the algorithm must, at some point, consider as F the set (?(!*). 
For this F, we have vip — vi^^j»^ < vi{G). By the minimality of Fopt, we conclude vip^^^ < vi^^j*^ < v£{G). 
Hence, vip^^^ < oo and so the tree Topt exists. Moreover, vip > vi{G) for all sets F, by the definition of vi{G). 
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Thus, we conclude c^f„p, = v£{G) and consequently by (6), Tgpt is a clique tree of G that minimizes the vertex 
leafage. This proves the correctness of the algorithm. 

Finally, let us analyze the complexity. Let n = \V{G) \ as usual. Recall that G has at most n maximal cliques. 
Thus there are at most rp- edges in 't^(G), and hence, at most n^^~^ choices for the set F. For each choice of F, we 
use Lemma 7 to find a clique tree T with (#'(T) = F if it exists. This takes 0{n^) for each F, including the calculation 
of v£p. This yields, altogether, running time 0{rfi^~^) = rP^^'^ as promised. □ 

We have shown how to calculate vertex leafage in polynomial time when the input graph has bounded leafage. It 
remains open whether this problem is fixed parameter tractable (FPT) with respect to leafage. 



5. Vertex Leafage with Optimum Leafage 

In this section, we prove Theorem 4 and Corollary 5. Namely, we demonstrate that the algorithm from [11], 
solving the leafage problem, satisfies the claim of Theorem 4. This algorithm, given a chordal graph G, outputs a 
clique tree of G with minimum possible number of leaves. This is done by starting from an arbitrary clique tree T of 
G, and iteratively decreasing the number of leaves of T as long as possible. 

We observe (and formally prove later in this section) that this process has the additional property that it never 
increases the number of leaves in the subtrees of the tree model 3/j defined by T. In other words, if T* is the clique 
tree resulting from this process, then = satisfies the claim of Theorem 4. This will imply that if the starting 
clique tree T reaUzes the vertex leafage of G, then ^7* = ^j* satisfies the claim of Corollary 5. 

For the proof of the above, we need to explain the inner workings of the algorithm from [11]. This algorithm, in 
place of clique trees, operates on the so-called token assignments defined as follows. 

For a chordal graph G, a token assignment of G is a function t that assigns to every maximal clique C of G, a 
multiset t(C) of subsets of C. We use the word token for the members of t(C). Note that the same subset may appear 
in t(C) many times. We focus on special token assignment that arise from clique trees. 

The token assignment defined by a clique tree T of G, and denoted by ej, assigns to every maximal clique C of 
G, the multiset £t{C) = {C H C' | CC' G E(r)}. In other words, ^t(C) consists of the intersections of C with its 
neighbours in T. A token assignment t is realizable if there is a clique tree T of G such that t = ej. 

(See Figure 1 for an illustration of these concepts.) 






Figure 1: a) Example chordal graph G. h) clique tree T of G, c) token assignment T — Ej. 



Notice that the token assignment t = £7 contains all the information needed to determine the number of leaves in 
T and also the number of leaves in the subtrees of the corresponding model We summarize this as follows. 

Lemma 8. Let G be a chordal graph, let T be a clique tree of G, and let = (T, {T'uSu'eM^G)) denote the tree 
model of G definedby T. Let T = £j, and define Tj,(C) = {S | S G t(C),m G S} for each u G V{G). Then 



• degj{C) = \T{C)\for all C E V{T), and 

• degj (C) = |ti,(C) I/or all u E V{G) and all C E V(T„). 

Consequently if(T) = |c |t(C)| = l| and^{Tu) = |c |t„(C)| = l^forallu E V{G). 
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In particular, while there can be multiple clique trees defining the same token assignment, these clique trees will have 
the same sets of leaves and consequently we do not need to distinguish them from one another In other words, it 
suffices to maintain that the token assignment we consider corresponds to some clique tree of G. This can be tested 
easily by applying four particular conditions as described in [1 1]. As we do not use this test here directly, we omit 
further details. (For more, see [11, Theorem 6].) 

Now, we are finally ready to explain the main steps of the algorithm from [11]. The algorithm is given a chordal 
graph G and a clique tree T of G. It starts by constructing the token assignment t = £j. Then it proceeds iteratively. 
During each iteration step, a current token assignment t is examined to determine if there exists a different token 
assignment corresponding to a clique tree with fewer leaves. This is done by checking for an augmenting path in t, 
which is a specific sequence of token moves (see definitions below). If an augmenting path exists, we pick the shortest 
such path and exchange tokens along the path. This results in a new token assignment t that corresponds to a clique 
tree with fewer leaves. If no augmenting path exists, we arrive at an optimal solution (i.e., a token assignment whose 
corresponding clique trees all have £{G) leaves) and we output this solution. We summarize the above procedure as 
Algorithm 2. Below we provide the missing definitions. 

Let G be a chordal graph and t be a token assignment of G. A token move is an ordered triple (Ci, C2, S) where 
Ci, C2 are maximal cliques of G and S G t{Ci). For a token move (Ci, C2, S), we write t (Ci, C2, S) to denote the 
token assignment t' that is the result of moving S from t(Ci) to t(C2). Namely', we have t'(Ci) = t(Ci) \ {S} 
and t'(C2) = t(C2) U {S}, while t'(C) = t(C) for all other C ^ {Ci, C2}. 

A sequence of token moves {C\, C2, Si), (C2, C3, S2), . . ., (C^t-l/ Sk-i) where fc > 1 is an augmenting path 
of T if |t(C;(-)| = 1 and each; G {1 . . .k — 1} satisfies 



(i) T -ir (C;, Ci+i, Sj) is a realizable token assignment^, and (ii) |t(C;) 




if/ = l 
otherwise 



See Figure 2 for an example of an augmenting path of a token assignment t and its application to t. 




Figure 2: a) token assignment T,b) augmenting patli {abc,adf,a), {adf,cdk, d) - directed edges, c) T after applying tlie path. 

It is easy to see that the application of an augmenting path decreases the number of leaves in the resulting to- 
ken assignment. This, however, does not guarantee that the resulting assignment corresponds to a clique tree of G. 
Fortunately, it can be proved that a shortest augmenting path has this property, and moreover, there always exists an 
augmenting path unless t corresponds to an optimal clique tree. The details can be found in [11]. We only remark the 
following invariant which is maintaned throughout the algorithm. 

Lemma 9. [11] In line 2 of Algorithm 2, the variable t is a realizable token assignment. 
After this introduction, we are ready to prove Theorem 4. 

Proof of Theorem 4. We prove the theorem by showing that each application of an augmenting path in Algorithm 2 
does not increase the number of leaves in the subtrees of the corresponding tree model. 



'Note that as both t(Ci) and t'(Ci) are multisets, to obtain t'(Ci) we only remove one instance of S from t(Ci) in case S appeal's in t(Ci) 
several times. This is consistent with the semantics of the set difference for multisets, 
^i.e., it corresponds to a clique tree of G. 
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Algorithm 2: Leafage( G, T) 



Input: A chordal graph G, and a clique tree T of G. 
Output: A cUque tree T* of G with \^{T*) \ = i{G). 

1 InitiaUze T £7- /* initialize the token assignment with the given clique tree. */ 

2 while there exists an augmenting path of t do 

3 Let (Ci, C2, Si), ... , (Cj._i, Cfc, Sfc_i) be a shortest augmenting path of t 

4 for all i from 1 to fc — 1 do 

5 I T ^ T-^ (C,-,C/+i,Si) 

6 return T* where £r* = T. 



In other words, let t be the token assignment considered at the start of some iteration (Lines 2-5) of Algorithm 2, 
and let [C\, C2, Si), . . ., (Cjt_i, Cjt, Sjt_i) be the shortest augmenting path of t considered in this iteration (Line 3). 
Let t' denote the value of t after applying the token moves of this path (Lines 4-5). 

By Lemma 9, both t and t' are realizable token assignments of G. In other words, there exist clique trees T 
and T' of G such that T — ej and t' — Sji. Let ^ = {^mIi/gVCG)) — {'^''{'^u}uev{G)) the 

corresponding tree models of G. In other words, for each u G V(G), we have T„ = T[{C e V{T) \ u e C}] and 

— T' [{C e y (r') | u G C}] . Moreover, just like in Lemma 8, we define for each u E V{G) and each maximal 
cUqueCof G, the sets T„(C) = {S | S G t(C),m G S} andT;;(C) = {S | S G t'(C),m G S}. 

Now, to prove the theorem, it suffices to demonstrate that \^{Tu)\ > |,5^(r,' ) | for every u G V{G). Consider 
u G V{G) and define two sequences of integers a\,. . .,a^ and h\,. . . ,hj^ where a, = |Tm(C;)| and — |t^(C,)| 
for all / G {1, . . .,k}. Note that Tm(C) = t^(C) for all C {Ci, . . . , C^}, and by Lemma 8, we have ^{Tu) — 
{C I |t„(C)| = 1} andJ5f(T;;) = {C | |t;;(C)| = 1}. This impUes the following. 

|if(T„)| - l^rai = \{Ci I |t„(Q)| = l}| -\{Ci I |t^(CO| = i}| = |{/ I Ui = 1}| - |{z I hi = 1} 

In other words, this boils down to showing that {/ | = 1} does not have more elements than | fl; = 1}. 

Recall that, by the definition of the augmenting path, |T(Cjt)| = 1 and |t(Cj)| = 2 for all / G {2. . .fc — 1}. 
Notably, since the path is shortest, Ci,...,Cj; are distinct maximal cliques of G. Thus, as t„(C) C t(C) for 
all C, we conclude that < 1 and fl; < 2 for all z G {1. . .k — 1}. Further, note that |T'(Ci-)| — 2 while 
\r'{Ci) I = |t(C;) I = 2 for all / G {2 ... - 1}. In other words, we have bi < 2 for all / G {2 . . . k}. 

We shall use the following two claims to show that | {/ | = 1} | < | {z | a, = 1} | . 

(7) If hi = 1, then a; > 1. 

Consider / G {1 . . .fc} such that hi — 1. First, we show that fl; > 1. Suppose that a, — 0. Since hi — 1, we 

have by Lemma 8 that 1 = bj — |t',(C,)| = degj/ (C,). In other words, C; is a leaf of T[^, and thus contains 

at least 2 vertices. Recall that V(T„) — V{Tl), and note that — Ui = |tu(Cj)| — degj.^(Cj) by Lemma 8. 
This means that C,- is a vertex of T„ with no neighbour in Tu- This is clearly impossible, since Tu is coimected and 
|y(r„)| = |y(r;i;)| > 2. Thus we must conclude that a, > 1. This proves (7). 

(8) Ifhi — 1 and Ui > 2, then there exists j > i such that — 1, h^ — 2, and Uy — hrfor allr & {/ + 1, . . . , / — 1}. 
To see this, first recall the construction of r' from t by moving the tokens Si, ... , Sj._i as follows. 

' T(Q)\{S;} iff = l 

t'(CO = I (t(CO \ {Si}) U {S,_i} if 1< / < fc 
, t(Q) U{Si_i} ifi-fc 

Also recall that fl/ = |t„(C,')| = |{S | S G t(C;),m G S}| and fc,- = |t/,(C/)| = |{S | S G t'(C/),m G S}|. 
From these two facts we conclude the following relationship between the values of a/ and bj (1 < i < k). 
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ax — 1 



if M e Si 
if M ^ Si 




a, 



a 



a 



a 



-1 
+ 1 



if M e Sj U S;_i 
if M € Sj \ S,_i 
if M e S;_i \ Si 

if M S;_l U Sj 




«/c + l 



if M e S^-i 
if M ^ Sfc_i 



Now, for the proof of (8), consider i G {1 . . . fc} such that hi = 1 and a,- > 2. By (★), we have |b, — fl, ! < 1 and 
thus fl; = 2. Further, i < k since > flfc by (*), but bi — 1 < 2 = a^. Moreover, u G S, since i < A: and bi — aj — 1 
by (*). We let / be the largest in {f + 1, . . . ,A: + 1} such that = h for each r G {/ + 1, . . — 1}. 

First, we observe that m G Sr for each r G {i, . . . ,j — 2}. Indeed, if otherwise, we let r be the smallest index in 
{/, — 2} with u ^ Sr- As we just argued, we have u G S/, and so r > f. Therefore, u G S;-_i by the minimality 
of r. But then fc^- = a^- + 1 by (*), since 1 < f < r < ;' — 1 < fc, a contradiction. 

This also implies that j < k. Indeed, if / = fc + 1, then / < / — 2 = fc — 1 since / < k. Thus u G Sj_2 — Sjt_i 
which yields bt — a^ + lby (★). However, G {i + 1, . . . , / — 1} and so b^ — a^ by the choice of /. 

We can now also conclude that u G Sy_i. Indeed, if i = j — 1, then we use the fact that u G S;. Otherwise, 
j < y — 2 in which case u G Sy_2 as argued above, and we conclude u G S^_i by (★), since l<i<j — l<k. 

Finally, we consider the value of First, suppose that / = k. Then b^ — at + 1, since u G Sj_i = Sj;_i. We 
recall that fl)t < 1 and so G {1,2}. If b^ = 1, we have flj; > 1 by (7), but then a^ > b^ = a^ + 1 > aj^, ?i 
contradiction. So, we must conclude b^ = 2 and a^ = 1. Thus, as = k, we have bj — 2, = 1, and = for 
all r G {/ + 1, — 1} as required. Thus we may assume that / < k. By the maximaUty of /, we have aj ^ bj. 
Also, u G Sj^i and 1 </<;'< fc. So by (*) we conclude that bj = aj + 1. We recall that bj < 2 as / > 1. Thus 
bj G {1, 2} as fly > 0. Again, if bj = 1, we conclude aj > 1 by (7) in which case fly > bj > aj, a contradiction. Thus 
bj = 2, fly — 1, and ar = by for all r G {f + 1, . . . , / — 1}, as required. This proves (8). 

We are now ready to conclude the proof. Denote A = {/ 1 a, = 1} and B — {i\bi — 1}. We show that |B| < \A\ 
which will imply the present theorem as argued above the claim (7). 

For each i G B, if fl/ = 1, we define (p{i) — i; otherwise, we define (p{i) — j where ; is the index obtained by 
applying (8) for i; note that fly — 1 and bj — 2. It follows that <p is a mapping from B to A. We show that q> is, in fact, 
an injective mapping. Suppose otherwise, and let /, z'"*" be distinct elements of B be such that (p[i) = (p{i^). Recall 
that bi = bi+ = 1 and note that i < cp{i) and f+ < (p{i~^)- If i = (p{i), then < (p{i^) = (p{i) = i implying 

< (p{i^) as i and f+ are distinct. So fl;+ Ihy the definition of cp, and hence b^(^i+-^ = 2 as cpii^) was obtained 
by applying (8) for /+. But then 1 = bi = b^(^i-j = b^i^i^-^ = 2, a contradiction. Thus we must conclude that / < (p{i) 
and, by symmetry, also < <p{i'^). Now, without loss of generality, assume i < /+. Since /+ < ^{i'^), we must 
have fl,+ 7^ 1 by the definition of (p. However, = 1 as i+ G B, and hence, 7^ bi+. Recall that the choice of 
(p{i) using (8) for i guarantees that fl^ = br for all r G {/ + 1, . . . , q>{i) — 1}. In particular, i < z+ < f{i~^) — (p{i) 
and so fl;+ — bi+ which is a contradition. This verifies that f is indeed an injective mapping from B to A, which 
yields |B| < \A\. 

This concludes the proof of Theorem 4. □ 
6. Concluding Remarks 

In this paper we have studied the vertex leafage of chordal graphs. Specifically, a chordal graph G = {V,E) has 
vertex leafage k when it has a tree model (T, {Tu}uev) such that each subtree T„ has at most k leaves. We have 
shown that, for every fixed > 3, it is NP-complete to decide if a split graph G has vertex leafage at most k even 
when G is known to have vertex leafage at most k+1. Additionally, we have demonstrated an n'-'^^^ algorithm to 
compute the vertex leafage of a chordal graph whose leafage is bounded by i. It remains open whether the vertex 
leafage is FPT with respect to leafage (or any other graph parameter). 

Finally, we have shown that every chordal graph G has a tree model which simultaneously realizes G's leafage 
and vertex leafage. In proving this final result we have also shown that, for every path graph G, there exists a path 
model with i{G) leaves in the host tree and that such a path model can be computed in O(n^) time. 
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